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ABSTRACT 

Friedreich's ataxia (FRDA) is a common hereditary 
degenerative neuro-muscular disorder caused by 
expansions of the (GAA)n repeat in the first intron 
of the frataxin gene. The expanded repeats from 
parents frequently undergo further significant 
length changes as they are passed on to progeny. 
Expanded repeats also show an age-dependent in- 
stability in somatic cells, albeit on a smaller scale 
than during intergenerational transmissions. Here 
we studied the effects of (GAA)n repeats of varying 
lengths and orientations on the episomal DNA rep- 
lication in mammalian cells. We have recently shown 
that the very first round of the transfected DNA rep- 
lication occurs in the lack of the mature chromatin, 
does not depend on the episomal replication origin 
and initiates at multiple single-stranded regions of 
plasmid DNA. We now found that expanded GAA 
repeats severely block this first replication round 
post plasmid transfection, while the subsequent 
replication cycles are only mildly affected. The fact 
that GAA repeats affect various replication modes in 
a different way might shed light on their differential 
expansions characteristic for FRDA. 

INTRODUCTION 

Friedreich's ataxia (FRDA) is a common progressive 
neurodegenerative disorder, resulting from a decrease in 
the level of frataxin, a protein involved in iron homeosta- 
sis in mitochondria (1,2). In most FRDA cases, this loss of 
function is caused by expansions of the GAA repeat 
situated in the first intron of the frataxin gene, FXN (3). 
The expanded GAA repeat interferes with the transcrip- 
tion of the FXN gene, likely by changing its DNA or 
chromatin structure (4-7). There is a correlation between 



the length of the GAA repeat and the severity of FRDA 
symptoms (8,9): normal individuals carry <30 repeated 
GAA triplets in their FXN alleles, carriers have 
pre-mutation alleles with 30 to 60 repeats (10); meanwhile, 
full-mutation alleles with 60 to 1700 repeats are typical of 
FRDA patients (2,11). 

The full- and pre-mutation FXN alleles are quite prone 
to large-scale changes in the number of repeats during the 
gametogenesis and first post-zygotic divisions. It is 
believed that expansions of the GAA repeat occur 
largely during early post-zygotic divisions. A repeat in- 
herited from the mother could be contracted or 
expanded with equal probabilities and its propensity to 
expand increases with mother's age (12). A repeat in- 
herited from the father is always shorter in progeny, as 
it massively contracts during spermatogenesis (12,13). In 
fact, it ends up being longer in the progeny than it was in 
the sperm, since it expands during early embryonic div- 
isions (12,14). The combined probability of expansions 
and contractions during the intergenerational transmis- 
sions is very high, ~85% (12). Another type of instability, 
which also contributes to the FRDA pathology, occurs in 
somatic cells (15,16). The degree of somatic instability 
differs between human tissues; the highest rate of expan- 
sions is observed in dorsal root ganglia, the most affected 
tissue in FRDA patients (16). 

The high instability of the GAA repeat during gameto- 
genesis and early embryogenesis could be due to replica- 
tion errors as the fork tries to progress through repetitive 
runs. In a multicopy yeast plasmid, expanded GAA 
repeats caused replication fork stalling (17). This stalling 
only occurred in one orientation of the repeat when the 
GAA run was situated in the lagging strand template. 
Coincidentally, the repeat instability was more 
pronounced in this same orientation. Similar observations 
were also made for repeats in bacterial plasmids (18). 
When GAA repeats were integrated into a yeast chromo- 
some, fork stalling remained orientation-dependent 
(19,20), but the rates of repeat expansions became 
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orientation-independent (19,20). At the same time, genetic 
screening revealed the genes encoding the replication fork 
components as the key players in repeat expansions in this 
system, which indicated that replication errors could occur 
in either orientation of the repeat (19,20). These replica- 
tion problems are likely due to the unusual structural 
properties of GAA repeats: they can form DNA triplexes 
(21), hairpin-like structures (22) and parallel duplexes (23). 
The orientation dependence of fork stalling at the 
repeat is highly indicative of triplex formation (24,25). 
The question remains, however, whether unusual DNA 
structures of the repeat can be formed prior to or during 
DNA replication in the context of complex mammalian 
chromatin. 

We have recently shown that the first replication cycle 
of an episome transfected into mammalian cells goes dif- 
ferently from all the subsequent replication rounds, most 
likely due to differences in template chromatin structure 
(26). This first replication cycle appears to initiate at 
various positions in the episome and depends on the 
presence of single-stranded DNA regions. Furthermore, 
it could take place in the late Gl phase of the cell cycle 
or even in non-dividing cells, implicating this replication 
mode in DNA repair. 

Here we show that carrier-size GAA repeats block the 
progression of the first episomal replication cycle in mam- 
malian cells. This blockage is much stronger than that 
previously observed in yeast and bacteria — in fact, it is 
almost complete. In contrast, subsequent episomal repli- 
cation cycles were not affected by those repeats and were 
only mildly inhibited by long, disease-size repeats. This 
shows that expanded GAA repeats have different effects 
on different modes of DNA replication. 

MATERIALS AND METHODS 

Plasmids 

The pUCneoGAAn and pUCneoCTTn plasmids were 
obtained by inserting the repeat-containing EcoRI 
EcoRI fragments of pYES-GAAlO, pYES-GAA20, 
pYES-GAA35, pYES-GAA57, pYES-GAA114 and 
pYES-GAA230 (17) into the blunt-ended Aatll site of 
pUCneo (26) in two orientations. The plasmids were 
named according to the leading strand template. 

Ori(-)GAA57 and Ori(-)CTT57 were obtained by in- 
serting the repeat-containing EcoRI-EcoRI fragment of 
the pYES-GAA57 (17) into the blunt-ended Aatll site of 
Ori(— ) (26) in two orientations. 

The pRepControl plasmid was obtained by inserting the 
Hindlll-Eco911 fragment of the pRep4 (Invitrogen), con- 
taining both OriP and the EBNA-1 gene, into the Apal- 
Ndel-digested pUCneo. RepGAA230 and RepCTT230 
were obtained by inserting the repeat-containing EcoRI 
EcoRI fragment of pYES-GAA230 into the Aatll site of 
pRep-control. 

Detection of alternative DNA conformations of the repeat 
using SI nuclease 

One microgram of Ori(-)GAA57 plasmid was digested 
with Nt.BstNBI nicking endonuclease (New England 



Biolabs) according to the manufacture's protocol, 
ethanol-precipitated, dissolved in TE (10 mM Tris-HCl, 
ImM EDTA) and then heated at 70° C for lOmin 
to remove possible alternative DNA structures. Nicked 
Ori(-)GAA57, as well as a control supercoiled Ori(-) 
GAA57 plasmids, were digested with 4 units of the SI 
nuclease (Fermentas) for lOmin at room temperature. 
The reactions were stopped with 10 mM EDTA, DNA 
was precipitated, dissolved in TE and heated again at 
70° C for lOmin. At this step, all potential alternative 
structures were eliminated from the control supercoiled 
Ori(-)GAA57 plasmid as it was converted into an open 
circular from by the nuclease. Both samples were then 
digested with Seal and Ndel restriction enzymes, and 
the 957-bp restriction fragments were eluted from 
agarose, mixed with 1 irg of pUCneo plasmid and 
digested with 4 units of SI nuclease for lOmin at room 
temperature. At this step, SI nuclease cleaved across the 
single-stranded breaks that were introduced in the first 
round of the SI nuclease digestion. Both samples were 
then analyzed on 1% agarose gel to reveal double- 
stranded breaks at expected triplex structures. 

Cell cultures and transient transfections 

All cell lines (COS-1 monkey fibroblasts, 293 A embryonic 
kidney fibroblasts, 293-EBNA, and HeLa cells) were 
grown in Dulbecco's modified Eagle medium (DMEM) 
supplemented with 10% fetal bovine serum. 

COS-1 and 293 A cells were transfected using lipofectin 
(Invitrogen), while 293-EBNA cells (Invitrogen) were 
transfected using turbofect (Fermentas) according to the 
manufacturer's instructions. 

Two-dimensional separation of replication intermediates 

Isolation of replication intermediates from mammalian 
cells and their analysis via two-dimensional (2D) 
neutral/neutral gel electrophoresis were performed as 
described in ref. 26. 

RESULTS 

Replication of an SV40 origin-based plasmid is affected 
by GAA repeats in a length- dependent and orientation- 
independent manner 

We analyzed the progression of DNA replication 
through GAA repeats of varying lengths inserted into a 
pSV2neo-derived plasmid, pUCneo (Figure 1A). The 
resultant plasmids were introduced into T-antigen- 
expressing COS-1 monkey fibroblasts by transient trans- 
fection. The repeats were positioned on the path of 
the counterclockwise replication fork initiated at the 
episomal SV40 origin. We carefully controlled the quality 
of DNA samples used for transfection to assure that they 
mainly contained monomelic plasmids. Thirty hours after 
transfection into mammalian cells, the plasmids and their 
replication intermediates were isolated. The replication 
intermediates were digested with the Afilll restric- 
tion endonuclease, which generated a fragment with a 
GAA repeat positioned at approximately one-third of a 
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Figure 1. (A) The scheme of pUCneoGAA/CTTn plasmid. (GAA) n 
repeat was inserted in pUCneo in two orientations. Replication inter- 
mediates were digested by Afllll. The hybridization probe used in 2D 
gel analysis corresponded to the GAA repeat-containing Afllll 
fragment. (B) Schematic presentation of the 2D gel electrophoresis of 
mammalian replication intermediates digested with Afllll. 
Corresponding shapes of replication intermediates are shown in gray. 
Spot 0 - unreplicated Afllll fragment; spot 1 - replication stalling of 
the counterclockwise replication fork at GAA; spot 2 - replication 
stalling of the clockwise replication fork at GAA; spot 3 - two repli- 
cation forks from opposite ends of the fragment stalled at the repeat. 
May also contain intermolecular complexes of two different fragments 
joined at the GAA repeat. A short stretch of the bubble arc was 
omitted from this scheme because it is obscured by unreplicated 
DNA spot in our 2D gel pictures. 



distance from its end. The replication intermediated were 
then analyzed by 2D agarose gel electrophoresis followed 
by Southern hybridization (Figure 2). 

We initially expected that replication stalling at the 
repeat would result in a 2D pattern consisting of a Y arc 
with a single bulge (Figure IB, spot 1), positioned about 
one-third of the way from the unreplicated DNA spot 
(Figure IB, spot 0), as was observed in yeast (17). 
Unexpectedly, however, the presence of long-normal 
repeat (GAA) 37 , pre-mutation length (GAA) 57 repeat or 
mutation length (GAA) 114 repeat in the plasmid led to the 
appearance of two bulges, positioned approximately 
one-third and two-thirds of the way from the unreplicated 
spot (Figure IB, spots 1 and 2). In the presence of the 
(GAA) 37 repeat, there was still a considerable amount of 
the descending part of the Y arc, which indicates that most 
of replication forks still progress past the GAA repeat. 
However, for the (GAA) 57 and (GAA) n 4, the descending 
part was very faint suggesting that progression of most of 
the replication forks was blocked by GAA repeat. Since 
the length of (GAA) 114 is more than 300 bp, replication 
stalling takes the form of a darker region of the arc rather 
than an isolated spot as was observed for (GAA) 37 . We 
believe that most of the forks were blocked by a 
non-canonical DNA structure formed within the 
expanded GAA repeat; more stalling was observed at 
longer repeats where the structure was more likely to 
form. Differently from our previous observations in 
yeast, however, we did not see a significant difference in 
replication stalling between the two orientations of the 
repeat within the replicon. 
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Figure 2. Analysis of pUCneoGAAn/CTTn replication in COS-1 cells by 2D gel electrophoresis. Replication intermediates were digested by 
Afllll restriction endonuclease as shown in Figure 1. GAAn and CTTn correspond to two orientations of a GAA repeat relative to the SV40 
replication origin. Plasmids' replication was blocked in a length-dependent and orientation-independent manner. The spots 1, 2 and 3, corresponding 
to Figure IB, are indicated. 
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Other unexpected elements that were never observed in 
yeast were two spikes, corresponding to double Y arcs, 
extending from each bulge. These double Y arcs connected 
the bulges with spot 3 at the upmost position of the 
vertical line, corresponding to the X-shaped recombin- 
ation products. The appearance of a double Y arc extend- 
ing from the bulge could be explained by the blockage of 
the replication fork, which reached the GAA repeat first, 
followed by the completion of the replication by another 
fork that entered from the opposite end of the fragment. 
In this case, spot 3 would correspond to the point where 
two forks coming from the opposite sides meet at the 
repeat. This pattern is known to occur when the replica- 
tion fork progression is not just slowed down, but rather 
completely blocked by the repeat (27). Spot 3 may also 
contain complexes of two different Afllll fragments from 
different DNA molecules joined at GAA repeat. 

This logic can easily explain the spot 1 as a result of 
blockage of the counterclockwise replication fork until the 
clockwise replication fork arrives. The explanation for the 
second stalling spot at the Y arc (spot 2) is less straight- 
forward. The location of the spot 2 at two-thirds of the 
way from the unreplicated fragment gave us a hint that it 
may result from blockage of a replication fork that ap- 
proached the repeat from the opposite side of the 
fragment relative to the SV40 origin. In this scenario, 
the distance that the opposing replication fork has to 
run to reach the repeat would be exactly two-thirds of 
the fragment's length. 

To confirm this idea, we digested the same replication 
intermediates with the Ncol and Bsal restriction enzymes, 
positioning the GAA run in the center of the fragment 
(Figure 3A). In this case, forks entering the fragment 
from its opposite ends should both reach the repeat at 
the middle of the fragment, i.e. the two bulges (1 and 2) 
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Figure 3. 2D gel electrophoresis of replication intermediates digested 
so that the (GAA) 35 repeat was in the middle of the fragment. (A) The 
scheme of the digest of pUCneoGAA/CTT35 by Bsal and Ncol. (B) 
2D gel electrophoresis of replication intermediates of pUCneo digested 
with Bsal and Ncol. A single bulge (combined spot 1, and spot 2, 
Figure IB) was observed upon Bsal-Ncol digest. A position of the 
spot 3 is also shown. We concluded that replication approached the 
GAA repeat from both ends of the fragment. 



should now convene at the fragment's center (Figure 3A). 
This turned out to be the case: one can clearly see a unique 
bulge in the middle of the arc, with a single spike 
emanating from it. We believe that the spikes connecting 
spots 1 and 2 to spot 3 result from replication forks 
entering the fragment from both ends: one of the forks 
is stalled, and another approaches from the opposite dir- 
ection (Figure 3B). 

While this was gratifying, we were still left with follow- 
ing question. It was easy to assume that the counterclock- 
wise replication fork could be blocked to the extent that 
the clockwise replication fork gets enough time to go 
almost a full circle until it meets its counterpart at the 
repeat (spot 1 and the spike). For the spot 2, a similar 
explanation would imply that the clockwise replication 
fork somehow managed to reach the repeat and get 
ahead of the counterclockwise fork. This does not seem 
possible if the two forks start at SV40 origin at the same 
time, since clockwise fork has almost a full circle to run, 
while the counterclockwise fork starts very close to the 
repeat. One possibility could be that the clockwise and 
counterclockwise replication forks are not synchronized 
in our episome, and the clockwise replication fork some- 
times starts before the counterclockwise. To exclude this, 
we digested the replication intermediates of pUCneo 
plasmid at the Seal site that is located just upstream of 
the origin (Figure 4A). Under the conditions of this digest, 
the clockwise replication fork that starts independently 
from the counterclockwise, and has enough time to go 
almost a full circle before the counterclockwise fork 
starts, would generate a bubble arc (Supplementary 
Figure SI A and SIB). We did not observe a bubble arc, 
but instead we observed an asymmetric double Y that 
could account for two replication forks starting simultan- 
eously (Figure 4B, left, Supplementary Figure SI A and 
SIB). A very similar asymmetric double Y was described 
in (28). We also observed the Y arc that could potentially 
originate from dimers. To confirm that the absence of the 
bubble arc is not an artifact of our analysis, we also 
digested our replication intermediates with the 
Mva 12691 enzyme cleaving episomal DNA exactly 
opposite to the origin. Under this digest, the bubble arc 
was clearly visible (Figure 4B, right). 

Another possibility was that the spot 2 (Figure IB) 
originated from the small amount of dimers in the 
sample. In this case, the replication fork that initiated at 
the second origin in a dimer can approach the repeat from 
the other end. To test if this was the case, we purified 
monomeric pUCneoGAA/CTT57 episomes from an 
agarose gel, and analyzed their replication in mammalian 
cells (Figure 4C). We observed the same pattern with two 
stalling sites and two spikes, thus ruling out that the 
second spot was the product of dimeric episomes' replica- 
tion (Figure 4C). 

An inability to explain the appearance of the second 
stalling site within the framework of the bidirectional, 
SV40 ori-initiated replication prompted us to analyze the 
replication mode of the pUCneo episome in more depth. 
The results of this analysis have been published separately 
(26). Rather unexpectedly, we found that replication of 
this episome could be initiated at numerous positions 
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Figure 4. (A) The scheme of digest of pUCneo plasmid used to explore whether replication of this plasmid can also occur unidirectionally. (B) 
Replication intermediates of pUCneo were isolated from COS-1 cells 23 h post transfection and digested by Seal (left) and Mval269I (right). The 
pattern of Seal digest consists of a Y arc and an asymmetrical double arc. No bubble arc is visible. The control Mval269I pattern consists of a Y 
and a bubble arc. The hybridization probe was the same as in all other assays: AflHI-fragment of pUCneo that contained the ampicillin resistant 
gene. (C) Replication of monomer pUCneoGAA/CTT57 plasmids in COS-1 cells. Replication intermediates were isolated 24 h post-transfection, and 
digested with Afllll. 



throughout its sequence soon after transfection into mam- 
malian cells. We confirmed that the replication intermedi- 
ates that we observed are not transferred over from 
bacteria, but indeed originated from mammalian cells. 
The alternative replication mode did not rely on the 
SV40 origin or T-antigen for its initiation but rather 
initiated in the absence of a regular chromatin structure, 
and was stimulated by single-stranded DNA regions (26). 

It should be noted that for our plasmids, the alternative 
replication mode apparently co-existed with the conven- 
tional replication mode initiated at the SV40 origin. Thus, 
the replication pattern shown in Figure IB likely resulted 
from a superposition of replication intermediates 
generated in two different replication modes: the 
SV40-driven and the alternative one. The relative contri- 
bution to the episomal replication from those two modes 
depended on how much DNA was used in transfection, 
and how long it was inside the cell prior to replication 
intermediates isolation. For example, analyzing replica- 
tion of gel-purified monomers (Figure 4C), we observed 
a decrease in Y arc intensity after spot 1, which may be 
explained by less efficient SV40-driven replication mode in 
this particular experiment. 

The first replication cycle of the SV40-based episome is 
blocked by the GAA repeat, while subsequent replication 
cycles are not affected 

Our previous analysis of the alternative replication mode 
revealed that it is limited to the first replication cycle after 
episomal transfection into mammalian cells (26). We 
believe that the pattern of stalling at GAA repeat that 
we observed (Figures 2-5) was a superposition of the 
first replication cycle that initiated randomly throughout 
the sequence, and SV40 origin-initiated replication. Since 



the first cycle replication initiated everywhere, it mostly 
produced Y arc, and some of the bubble arc that we 
could only observe at overexposed pictures (Figure 5A, 
overexposed panel). During the first replication cycle, rep- 
lication intermediates consist of the template DNA 
strands that came from Escherichia coli plasmid and 
nascent strands synthesized in mammalian cells. In subse- 
quent replication cycles, both nascent and template 
strands are of mammalian origin. This allows one to dis- 
tinguish between the first and the subsequent replication 
cycles using a Dpnl restriction digest. This enzyme cleaves 
GATC sequences when they are methylated or 
hemimethylated by the bacterial Dam methylase. Thus, 
the Dpnl digest eliminates the non-replicated episomal 
DNA, as well as the products of its first replication cycle 
in mammalian cells. 

Figure 5A, right panel, shows the pattern of replication 
intermediates that were extensively treated by Dpnl prior 
to their electrophoretic separation. Remarkably, after this 
treatment the replication arc became smooth, while the 
two bulges, most of the spot 3, and the double Y spike 
disappeared, indicating that later rounds of replication 
were not significantly affected by the presence of GAA 
repeats. Even the longest repeat, (GAA) 2 3o, failed to 
stall the fork during the late, T-antigen driven rounds of 
the episomal replication (Figure 5B). We concluded, there- 
fore, that only the very first round of episomal replication 
was strongly inhibited by GAA repeats. 

This conclusion was independently confirmed by the 
analysis of replication through the (GAA) 57 repeat at dif- 
ferent time points after cell transfection. For these experi- 
ments, we used 0.5 ug of the episomal DNA per 
transfection, i.e. 40-fold less than the amount of DNA 
used in the experiments shown in Figure 2, in order to 
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Figure 5. (A) Replication stalling was no longer observed upon Dpnl digest of replication intermediates. 2D gel electrophoresis of the replication 
intermediates of pUCneoGAA57 isolated from the COS-1 cells and digested with either Afllll alone, or Afllll and Dpnl. To eliminate 
semimethylated products of the first replication cycle, the digest was performed with 20 units of Dpnl, and 10 units of Afllll for about 2h. 
Bubble arc can be observed on the overexposed panel of pUCneoGAA57 digested with Afllll (shown by an arrow). (B) An expanded (GAA) 2 3o 
repeat did not cause replication stalling of the T-antigen-driven replication. 2D gel electrophoresis of the replication intermediates of pUCneoGAA/ 
CTT230 isolated from COS-1 cells and digested with Afllll and Dpnl. (C) Replication stalling at the GAA repeat decreased over time. 2D gel 
electrophoresis of replication intermediates of the pUCneoGAA57 plasmid isolated at 2, 12, 18, 24 and 30 h after COS-1 cells transfection. Only 
gel-purified monomer plasmid was used for transfection. Replication intermediates were digested with Afllll. The descending part of the Y arc 
increased, while the bulges and the double Y arc decreased with incubation time. 



promote its rapid utilization by the mammalian replica- 
tion machinery. Figure 5C shows that there is a gradual 
decrease in the strength of the fork stalling over time: 
while the Y arc does not proceed beyond the repeat 
during the first 12 h post-transfection, its progress, as 
illustrated by the appearance of the descending arm, 
becomes evident after 18 h, while the descending part of 
the arc reaches the same intensity as the ascending part 
30 h post-transfection, at which point the spikes and fork 
stalling drastically decrease. 

To explore whether replication stalling occurred in the 
course of the alternative replication, we analyzed the rep- 
lication of the Ori(-)GAA57 plasmid, which did not 
contain the SV40 replication origin (Figure 6A). 
One can see that in COS-1 cells this plasmid was still 
capable of undergoing some replication, which was 
characterized by the profound fork stalling at the GAA 
repeat (Figure 6B). The fact that the descending portion of 
Y arc beyond the spot 2 was not detectable in the Ori(-) 
plasmids is consistent with the complete replication 
blockage at GAA repeat. 

We have further confirmed that this first round of al- 
ternative replication does not require T-antigen for its ini- 
tiation. Replication fork stalling at the (GAA) 57 repeat 
was observed upon episome transfection into the human 
293A cell line, which does not express viral T-antigen 
(Figure 6E). We concluded that profound fork stalling 
at GAA repeats in mammalian cells is only evident 
during replication in early post-transfection stages (26). 

Summarizing, we found that an alternative replication 
mode for the episome is blocked near completely by the 
GAA repeat. We cannot, however, deduce from our data 
whether the very first round of the T-antigen-driven rep- 
lication is blocked by the repeat as well. 



Blockage of the first replication cycle could be caused by 
non-B DNA structures pre-existing in episomal DNA, as 
well as the structures that formed inside mammalian cells 

The first replication cycle of a transfected plasmid could 
be blocked by a DNA structure that either was formed by 
the repeat inside a mammalian cell or pre-existed in the 
plasmid used for transfection. Magnesium ions, which are 
normally present in the transfection media, could promote 
the formation of triplex by the GAA repeat (21). To de- 
termine whether the alternative replication can be affected 
by DNA structures that form inside the cell, we eliminated 
the pre-existing structures and tested whether stalling at 
the GAA repeat would still occur. We capitalized on the 
fact that triplex or sticky DNA structures are readily 
formed in superhelical plasmids in presence of magnesium 
ions or under acidic pH, but are extremely unfavorable in 
open circular or linear DNA (29,30). First, we used the 
plasmid sample isolated by alkali lysis from bacteria and 
dissolved in TE pH 8.5, the conditions in which triplex 
structure does not form. Second, we introduced 
single-stranded breaks by Nt.BstNBI nicking enzyme, 
and additionally heated the sample for 20min at 70° C 
in the presence of EDTA to get rid of all potential alter- 
native DNA structures. Nt.BstNBI introduced six 
single-stranded breaks scattered around the plasmid 
(Figure 6 A, shown by circles). To confirm that our 
nicked plasmid sample does not contain alternative 
DNA conformations, we employed the SI nuclease, 
which cleaves DNA at single-stranded regions existing in 
intramolecular triplexes or other alternative DNA struc- 
tures. The supercoiled GAA-containing plasmid was used 
as a control, since it is known to form triplex structure 
under acidic pH in SI nuclease buffer (25). We treated 
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Figure 6. (A) The scheme of the Ori(-)GAA/CTT57 plasmid. In the 
Ori(-) plasmid, the SV40 replication origin was replaced by the 
nuclear localization sequence (26). (B) 2D gel electrophoresis analysis 
of replication intermediates of Ori(-)GAA/CTT57 and Ori(-), isolated 
from Cos-1 cells. Replication of Ori(-) plasmid was blocked by the 
(GAA) 57 repeat in both orientations. (C) 2D gel electrophoresis 
analysis of replication intermediates of nicked Ori(-)GAA57 plasmid 
isolated from Cos-1 cells. Replication of the nicked plasmid that did 
not contain preformed structures was still affected by GAA repeat. The 
positions of nicks introduced by Nt. BstNBI are shown by small circles 
in Figure 6A. We concluded that the GAA repeat forms an alternative 
structure that stalls replication inside the mammalian cell. (D) SI 
nuclease digest of nicked and supercoiled Ori(-)GAA57 plasmids to 
confirm that alternative DNA structures do not form in the nicked 
Ori(-)GAA57. 1 - series of bands corresponding to the GAA repeat 
position that are visible only for the supercoiled sample; 2 - 957 bp 
Scal-Ndel fragment; 3 - supercoiled pUCneo, 4 - open circle pUCneo. 
(E) 2D gel of the replication intermediates of pUCneoGAA57 isolated 
from 293A cells. Alternative replication of pUCneoGAA57 happens in 
the absence of T-antigen and is severely stalled by the GAA repeat. The 
stalling points are indicated by arrows. 



nicked and supercoiled Ori(-)GAA57 with the SI nuclease, 
followed by the digestion with the Seal and Ndel restric- 
tion enzymes and purification of the 957-bp fragment con- 
taining (GAA) 57 -repeat from an agarose gel (Figure 6D, 
band 2). This protocol allowed us to analyze only the 
single-stranded breaks generated by the SI nuclease in 
the repeat-containing DNA fragment. (We deliberately 
limited our analysis to this Scal-Ndel fragment to avoid 
interference with DNA fragments generated upon the SI 
digestion opposite to the single-stranded breaks made by 
Nt. BstNBI.) We then subjected the eluted fragments to yet 
another round of the SI digest to convert single-stranded 
breaks into the double-stranded ones. A supercoiled 
pUCneo plasmid was added to the samples to confirm 
the SI nuclease activity that converted the supercoiled 
pUCneo (Figure 6D, band 3) into the open circle 
(Figure 6D, band 4). Subsequent gel-electrophoretic 
analysis showed multiple double-stranded breaks at the 
position of GAA repeat in the supercoiled but not in the 
nicked plasmid (Figure 6D, band 1). These data confirm 



that nicked plasmid in our transfection experiments did 
not contain pre-formed alternative structures. 

When we transfected Cos-1 cells with Ori(-)GAA57 
plasmid, in which the alternative DNA structures of the 
GAA repeat were eliminated by nicking and heating, fork 
stalling at the repeat was still observed. Note, however, 
that both the replication arc and stall sites became less 
pronounced than in the case of superhelical DNA 
(Figure 6C). We believe that this decrease in replication 
efficiency was due to the fact that initiation of the alter- 
native replication depends on DNA supercoiling (26). 
Thus, a nick should be repaired and supercoiling 
introduced in order for the transfected plasmid to start 
replicating. No supercoiled plasmid was visible in the 
initial sample that was used in transfection 
(Supplementary Figure S2), so the arc that we observed 
could not be a result of a preformed structure in the mol- 
ecules that escaped nicking. Fork stalling observed in the 
last experiment must, therefore, be associated with a DNA 
structure that was formed inside a mammalian cell. 
Decreased efficiency of the replication stalling might be 
due to the fact that some of the plasmids got 
chromatinized even before the nicks were sealed, making 
triplex formation impossible. 

We conclude that while a significant fraction of 
repeat-mediated stalling observed during the first replica- 
tion round is likely due to pre-existing non-canonical 
DNA structures formed in the transfection media, struc- 
tures formed inside the cell impede the replication fork 
progression as well. 

The initiation of alternative replication is stimulated by 
the presence of GAA repeats 

As discussed above, the elongation stage of the alternative 
replication is severely inhibited by the presence of even 
relatively short GAA repeats. Remarkably, however, the 
same repeats seem to increase the rate of alternative rep- 
lication initiation. This is evident, for example, from the 
fact that the intensity of the replication arc approaching 
the repeat is much higher than that for the control, 
repeat-free plasmid (Figure 7A). Note that about the 
same amount of the plasmid was loaded for 
GAA-containing and control plasmids (Figure 7B). 
Similar increases in the alternative replication efficiency 
of the repeat-containing plasmids were consistently 
observed in all our experiments, which required us to 
load significantly more replication intermediates for the 
control plasmid to equalize them with the intermediates 
of the repeat-containing plasmids (Figure 6B). 

Replication of Epstein-Barr virus-derived episomes is 
mildly affected by the GAA repeats 

One explanation for why only the first replication cycle of 
the pSV-derived episome was inhibited by the GAA 
repeats could be that T-antigen serves as a replicative 
helicase during all subsequent replication cycles. 
T-antigen is a very potent helicase that was shown to 
pass through alternative DNA structures (31,32). To 
evaluate this hypothesis, we decided to analyze the repli- 
cation of GAA repeats in another episomal system driven 
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Figure 7. (A) The first round of replication of the plasmid containing 
GAA repeat is more pronounced than that of a control plasmid. The 
same amount of plasmids Ori(-)CTT57 and Ori(-) were used for 293A 
transfections. The cells were lysed 6h after transfection, replication 
intermediates were isolated and digested with Afllll. The samples 
were intentionally run together to allow direct comparison. (B) The 
samples Ori(-)CTT57 and Ori(-) contain about the same amount of 
plasmid. 



by the Epstein-Barr virus (EBV) replication origin (33). 
In this case, replication is initiated at the OriP origin upon 
binding of the EBNA protein (34), followed by the loading 
of other components of the mammalian replication fork 
(35). Consequently, the MCM2-7 complex, rather than 
T-antigen, serves as the replication helicase here. 

293-EBNA cells were transfected with the 
pRepGAA230 or pRepCTT230 plasmids, which 
harbored the (GAA) 2 3o repeat in two orientations 
(Figure 8A). Replication intermediates were isolated in 6 
days after transfection, and digested by EcoRI and Xbal 
enzymes, which positioned the repeat at one-third of the 
length of the fragment from an OriP-proximal end. Since 
replication intermediates were isolated 6 days post- 
transfection, they contained only a small portion of 
initial bacterial sample (no more than 10% of episomal 
replication intermediates, 20 ng of which were loaded on 
the 2D gel in Figure 8B). An extensive Dpnl digest was 
then performed to eliminate this residual amount of initial 
bacterial DNA. The activity of Dpnl enzyme was 
controlled in Supplementary Figure S3 (Supplementary 
Data). This digest would also eliminate the residual 
products of the first replication cycle, although we never 
observed them 2 days post-transfection anyway (26). An 
extensive Dpnl digest was then performed to make sure 
that we were only analyzing plasmids that replicated in 
mammalian cells more than once. Electrophoretic 
analysis of the replication intermediates revealed the 
presence of a characteristic double Y arc with a bulge 
(Figure 8B, shown by arrows), similar to what was 
observed during the first round of replication of the 
pUCneoGAA35 plasmid. The appearance of the double 
Y arc was indicative of replication being blocked by the 
GAA repeat at least in a fraction of plasmids. It should be 
noted, however, that this blockage was observed in only a 
fraction of molecules in contrast to a near-complete 
blockage of the first replication round by the same 
repeat (Figure 2). The observed pattern of fork stalling 
was not an artifact of large-scale deletions at the 
GAA230 repeat: it stayed intact throughout the 
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Figure 8. (A) The scheme of the pRepGAA/CTT230 plasmid, digested 
with EcoRI and Xbal. (B) Replication of pRepGAA/CTT230 is only 
mildly affected by the repeat. The plasmid replication intermediates 
were isolated from 293-EBNA cells 6 days after transfection, digested 
with EcoRI, Xbal and Dpnl, and analyzed by 2D gel electrophoresis. 
The hybridization probe corresponded to the Xbal-EcoRI fragment 
containing the GAA repeat. The presence of the double Y arcs is in- 
dicative of a partial replication blockage. 



experiment, and there were no significant changes in the 
length of the restriction fragment in question as is evident 
from the lack of shift in the position of the descending part 
of the Y arc. 



DISCUSSION 

This study demonstrates, for the first time, that GAA 
repeats can block the replication fork progression in mam- 
malian cells likely due to the triplex formation. 
Remarkably, different modes of episomal replication in 
mammalian cells were affected differently by GAA 
repeats. We have previously demonstrated that the first 
round of replication of the SV40-derived plasmids in 
mammalian cells could be carried out by the alternative 
replication mode, which does not require the presence of 
the viral replication origin and is initiated at various sites 
in the plasmid (26). Here we report that this replication 
mode is almost completely blocked by the presence of 
GAA repeats. On the other hand, replication of an 
EBV-derived plasmid, initiated by the EBNA protein at 
the OriP origin, is only mildly inhibited by long GAA 
repeats. Finally, T-antigen-driven replication of the 
pSV2-derived plasmids was not affected by the GAA 
repeats after the second replication round. 

Thus, GAA repeats impede the progression of the mam- 
malian replication fork, which is involved in both alterna- 
tive and OriP-initiated replication in mammalian cells, but 
have no effect on the progression of a fork where viral 
T-antigen serves as a replication helicase. This is not 
surprising, given that T-antigen is an exceptionally 
potent DNA helicase, and was shown to unravel various 
unusual DNA structures (31,32). Note, also, that another 
expandable repeat, (GGC) n , stalled the progression of 
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T-antigen-driven forks (36); however, this stalling was less 
pronounced than the stalling observed in yeast (37). We 
believe that the latter stalling resulted from the formation 
of a hairpin-like structure in the lagging strand template 
DNA, i.e. when the replicative helicase has already passed 
the repeat. The formation of such hairpin structures in 
mammalian cells has been recently demonstrated (38). 

A particularly potent stalling of the first episomal rep- 
lication cycle by the GAA repeat is most likely caused by 
its unusual DNA conformation(s). In naked superhelical 
DNA, this repeat can form a triplex (21) or a more 
complex sticky DNA structure, a triplex formed between 
the two separate GAA runs (29). Since alternative repli- 
cation operates on superhelical DNA that just started to 
assemble in chromatin, appearance of these structures 
ahead of the replication fork would inevitably block its 
progression. In the EBV-derived plasmid, the native rep- 
lication fork replicates chromatinized DNA templates. 
The presence of a regular nucleosome structure would 
limit the ability of a GAA repeat to form unusual DNA 
conformations, explaining much more modest replication 
inhibition. 

We believe that the first replication round is stalled so 
profoundly, compared to the subsequent rounds, because 
the inhibitory DNA triplexes pre-exist in plasmids used 
for the transfection. DNA triplexes can easily form in 
superhelical DNA isolated from bacteria in the presence 
of magnesium ions in the transfection mix. To address the 
question whether the GAA repeat can affect replication in 
the lack of the pre-formed triplex, we introduced nicks 
into our plasmid DNA followed by heating it in the 
absence of bivalent cations, which unraveled all 
pre-formed DNA structures (30,39). After transfecting 
mammalian cells with this nicked DNA, we still detected 
replication stalling, although somewhat less pronounced. 
This result suggests that a triplex or some other 
non-canonical structure can form in DNA inside the 
mammalian cells. The latter observation is also consistent 
with the data on the modest inhibition of the fork pro- 
gression by GAA repeats in EBV-derived plasmids. 

While the GAA repeat did not serve as a replication 
origin during alternative DNA replication, as it was 
evident from the lack of a pronounced bubble arc in our 
2D gels, it did increase the efficiency of the first replication 
round, somehow making a plasmid more competent in 
terms of replication initiation. This characteristic of an 
alternative initiation mode was also observed in other 
systems, when a particular sequence made a DNA 
template more prone to replication without being an 
origin (40,41). 

The increase in the efficiency of alternative replication 
could be due to the formation of a non-canonical DNA 
structure by the GAA repeat. Indeed, non-canonical DNA 
structures such as hairpins and intramolecular triplexes 
are accompanied by single-stranded regions. We have pre- 
viously shown that the presence of single-stranded DNA 
induced by superhelicity can promote alternative replica- 
tion initiation (26). However, the subsequent electrophor- 
etic analysis of replication through the Bsal-Ncol DNA 
fragment containing the GAA repeat in its center did not 
reveal a trace of the bubble arc, which rules out that the 



repeat per se serves as a replication initiation point. We 
hypothesize, therefore, that a single-stranded DNA part of 
the intramolecular triplex might serve as a loading dock 
for the replication fork components that would then 
initiate replication elsewhere. 

What are the implications of our findings for the FRDA 
pathogenesis? It is generally believed that expansions of 
GAA repeats occur in early embryonic divisions during 
intergenerational transmissions in FRDA families (12). 
Recently, it was also demonstrated that these repeats 
expand with an almost 100% probability upon transcrip- 
tion factor-mediated reprogramming of fibroblasts from 
FRDA patients (42,43). Pluripotent cells' chromatin con- 
figuration is more open than that of differentiated cells 
(44). The state of chromatin in mammalian zygote is 
understudied; however, the major portion of chromatin 
in sea urchin zygote is in unfolded nucleosome structure, 
which mostly consists of non-regular positioned nucleo- 
somes (45). These data hint that open chromatin structure 
may facilitate GAA repeat expansions. In our case, repli- 
cation stalling is much more pronounced in the lack of 
regular chromatin in episomal DNA, as it makes triplex 
formation more feasible. This consideration makes it 
tempting to speculate that abnormal replication of long 
(GAA)n repeats in the context of open chromatin could 
contribute to their expansions. Note, however, that the 
analogy between the chromatin structure of our 
episomes and the chromatin of the mammalian zygote is 
not substantiated by the experimental data and remains 
purely speculative. 

Another possibility comes from recent observations that 
the instability of the FRDA repeat in mammalian 
chromosomes was strongly enhanced by transcription 
through the repetitive region (46,47). It is possible that 
single-stranded DNA segments accumulating upon the 
formation of stable DNA-RNA complexes in the repeat 
could promote alternative replication initiation, which 
would also lead to the repeat's instability. Supporting 
this notion, the replication efficiency was much higher 
for pUCneoGAA57 compared to Ori(-)GAA57 in 293 
cells. Since these cells lack T-antigen, transcription from 
a promoter adjacent to the SV40 origin could contribute 
to the initiation of alternative DNA replication. 

Finally, we have previously shown that the alternative 
replication mode was also detected outside of the S-phase 
of the cell cycle, synchronized at the Gl/S border (26). 
This indicates that the alternative replication mode 
could be associated with cellular DNA repair. In fact, 
repair can be stimulated by the presence of alternative 
DNA conformations including those formed by GAA 
repeats (48). Instability of the GAA repeat was also 
observed in post-mitotic cells (16), which could be 
attributed to alternative replication/repair process. 

In summary, we found, for the first time, that different 
modes of replication in mammalian cells are affected dif- 
ferently by the expandable GAA repeats causative of 
FRDA. This result might help to explain the differences 
in the rates and scales of repeat instability between 
somatic and embryonic cell divisions characteristic for 
this disease. 
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